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Abstract 

We present a new randomized algorithm for computing the diameter of a weighted directed 
graph. The algorithm runs in 0(M'^/("+i)n('^''+^)/("+i)) time, where w < 2.376 is the exponent 
of fast matrix multiplication, n is the number of vertices of the graph, and the edge weights are 
integers in {— M, . . . , 0, . . . , M}. For bounded integer weights the running time is 0(n^-^^^) and 
if w = 2 + 0(1) it is 0{n7^^). This is the first algorithm that computes the diameter of an integer 
weighted directed graph polynomially faster than any known All-Pairs Shortest Paths (APSP) 
algorithm. For bounded integer weights, the fastest algorithm for APSP runs in 0(n2-5^5) time 
for the present value of to and runs in 0{n^'^) time if a; = 2 + o(l). 

For directed graphs with positive integer weights in {!,..., M} we obtain a deterministic 
algorithm that computes the diameter in 0{M'rf) time. This extends a simple 0(n'^) algorithm 
for computing the diameter of an unweighted directed graph to the positive integer weighted 
setting and is the first algorithm in this setting whose time complexity matches that of the fastest 
known Diameter algorithm for undirected graphs. 

The diameter algorithms are consequences of a more general result. We construct algorithms 
that for any given integer d, report all ordered pairs of vertices having distance at most d. The 
diameter can therefore be computed using binary search for the smallest d for which all pairs are 
reported. 

1 Introduction 

Computing the diameter and, more generally, computing distances, are among the most fundamental 
algorithmic graph problems. In the Diameter problem we are given a graph and are required to find 
the largest distance between two vertices of the graph. The more general All-Pairs Shortest Paths 
(APSP) problem asks to find distances and shortest paths between all pairs of vertices of the graph. 
Clearly, any algorithm for APSP can be used as an algorithm for Diameter. The converse, however, 
is not necessarily true. 

Unfortunately, at present, we do not know of any algorithm for general weighted graphs that 
solves Diameter asymptotically faster than APSP and the existence of such an algorithm is an open 
problem (see, e.g., [HEIIS]). 

In the case of graphs with arbitrary real edge weights, no truly sub-cubic algorithm for APSP 
or for Diameter is known. The presently fastest algorithm for both is an algorithm of Chan \A\ that 
runs in 0{n^ log^ logn/ log^ n) time, where n in the number of vertices of the graph. For sufficiently 
sparse graphs, the APSP algorithm of Johnson |10] performs better as it runs in 0{mn + n^logn) 
time where m is the number of edges. 
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When the edge weights are integers, fast matrix multiphcation techniques are useful. For undi- 
rected graphs with integer edge weights in {1, . . . , M} an APSP (and thereby a Diameter) algorithm 
of Shoshan and Zwick [13] runs in 0{Mn^]i}\ time, where io < 2.376 is the exponent of matrix mul- 
tiplication. This algorithm generalizes earlier O(n^) algorithms of Seidel and of Galil and Margalit 
for the unweighted case [121 E] • 

The situation becomes more involved in directed graphs, as negative edge weights may be allowed. 
In the presence of weights in {—M, . . . , 0, . . . , M}, the fastest APSP algorithm (and presently fastest 
Diameter algorithm) is by Zwick [17j . It runs in 

O^Mi/{i-uj)^2+i/{4-uj)^^ time, and an additional 
small speedup is obtained if fast rectangular matrix multiplication is used. For bounded M, this 
speedup results in a running time of 0(n^'^^^). Interestingly, Zwick's algorithm is also the fastest 
known algorithm for unweighted APSP in directed graphs. The directed unweighted setting is also 
the only known setting where Diameter has an algorithm that is presently faster than APSP. It was 
observed by Zwick (private communication) and also by Bjorklund (private communication) that the 
Diameter of a directed unweighted graph can be computed in 0{n^) time using a repeated squaring 
argument of the adjacency matrix, combined with binary search. This approach, however, does not 
directly extend to the weighted setting even when the weights are positive integers in {1, ... , M}. 

The main results of this paper present the first algorithm(s) for Diameter that are polynomially 
faster than APSP in integer weighted directed graphs. Our first algorithm computes the diameter 
of a directed graph with integer edge weights in {— M, . . . , 0, . . . , M} polynomially faster than the 
aforementioned APSP algorithm of Zwick. Our second algorithm computes the diameter of a directed 
graph with integer edge weights in {1, . . . , M} even faster. The time complexity we obtain in this 
case matches the time complexity of the aforementioned APSP algorithm of Shoshan and Zwick that 
applies to the undirected setting. 

In fact, both algorithms are, respectively, consequences of two other algorithms that solve a 
related problem that can be viewed as more general than Diameter. 

The problem we are considering is Threshold APSP where, in addition to the input graph we 
receive a threshold value d. The goal is to report all ordered pairs of vertices having distance 
at most d. Any algorithm for Threshold APSP can be converted into an algorithm for Diameter 
by applying Threshold APSP a logarithmic number of times. Indeed, since our edge weights are 
integers in {—M, . . . , 0, . . . , M} the diameter is either oo or is an integer in {0, . . . , M{n — 1)}H. 
Using binary search one can locate the diameter which is the smallest d such that Threshold APSP 
reports all ordered pairs of vertices. Observe that Threshold APSP is an interesting problem on its 
own. Especially interesting are the cases d = and d = — 1 in the negative weight setting. The case 
d = can be used to find all pairs that have positive distance and the case d = —1 can be used to 
find all pairs that have negative distance. Using two consecutive values of d one can also locate all 
pairs having any specific given distance. Our main results are, therefore, algorithms for Threshold 
APSP. 

Theorem 1.1 Let G = {V,E) be directed graph with n vertices and having integer edge weights 
in {— M, . . . , 0, . . . , M} and let d be an integer. Then, with high probability, the set of all ordered 
pairs of vertices having distance at most d can be computed in 0{M^^^^^^^n^'^ time. In 

particular, the diameter of G can be computed in 0{M^/^^'^^^n^^ +3)/(t^+i)^ time. 

^Throughout this paper, 0{f{n)) stands for f{n)n°''^K 

^ As usual, we assume that the input graph has non negative weight cycles. This can also be verified initially using 
known algorithms such as [16] in running times that are faster than the claimed running time of our algorithm. 
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For uj = 2.376 and for bounded edge weights, the running time of our algorithm is 0(n^'^^^). It is, 
therefore, polynomially faster than the 0(n^'^'^^) algorithm of Zwick. An even larger gap exists if 
UJ = 2 + 0(1), as may turn out to be the case. In this case, Zwick's algorithm, as well as an earlier 
algorithm of Alon, Galil, and Margalit |2j, run in 0{n?'^) time. It is, thus, plausible that ^(n^'^) is 
a barrier for APSP in directed graphs. Our algorithm, on the other hand, computes the diameter in 
0(n''/^) time, assuming oj = 2 + o(l). Our result thus intensifies the plausibility of a true complexity 
gap between Diameter and APSP in weighted directed graphs. 

For unbounded edge weights, our algorithm is also faster than Zwick's 0{M^^^^^^^n'^^^^^'^^^^) 
algorithm (also if fast rectangular matrix multiplication is used in the latter) as long as M < n^^"^. 
But observe that for M > n^^^ both algorithms become cubic and it is preferable to use Chan's 
general APSP algorithm in this case. 

The main idea of our algorithm is as follows. We start by computing values that capture the 
actual distance between all pairs that are "far apart" (pairs whose distance is realized only by a path 
that has relatively many edges). This is done using the well-known "long shortest path" technique. 
The more difficult case is to capture distances connecting pairs that are not far apart. For this 
purpose we use an appropriately computed redundant partial distance matrix that is a generalization 
of partial distance matrices introduced in |16) . A scaled version of this matrix enables us to obtain 
a good additive approximation for the distance between such pairs. If the approximated value is 
sufficiently smaller than the threshold d then we already know that such pairs must be reported. For 
pairs whose approximation is close to d we can use a truncated and shifted version of the redundant 
partial distance matrix in order to compute their distance precisely. 

We now state our result for the positive integer weighted case. 

Theorem 1.2 Let G = iV^E) he directed graph with n vertices and having integer edge weights in 
{1, . . . ,M} and let d he an integer. Then, the set of all ordered pairs of vertices having distance 
at most d can he computed in 0{Mn^) time. In particular, the diameter of G can he computed in 
d{Mn'^) time. 

Similar to other shortest paths algorithms, both of our algorithms for Threshold APSP can 
actually construct a path data structure. For two vertices u, v reported as having distance at most 
d, we can generate an actual path from u to v whose total weight is at most d in time that is 
proportional to the number of edges of the generated path. Observe, however, that for the purpose 
of computing the diameter, we do not even need to construct such a data structure. Once we have 
computed the diameter value d, we can use the results of Threshold APSP with the threshold d — 1 
in order to locate all pairs of vertices that realize the diameter precisely. These are all pairs reported 
for the threshold d and not reported for the threshold d — 1. Now, if we pick any such pair u, v, we 
can perform a Single Source Shortest Paths computation from u in 0{Mn^) time (see, e.g. [16^ lllj) 
and obtain an actual path that realizes the diameter. 

The rest of this paper is organized as follows. The next section contains some preliminary 
definitions, notations, and known results that are required for the proof of the theorems. Section 
3 describes redundant partial distance matrices that are an important ingredient in the proof of 
Theorem 11.11 Section 4 contains the proof of Theorem 11.11 and Section 5 contains the proof of 
Theorem II. 2 [ The final section contains some concluding remarks. 
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2 Preliminaries 



Let T{1, m, n) be the minimal number of algebraic operations needed to compute the product of an 
I X m matrix by an m x n matrix. 

Definition 2.1 (Matrix multiplication exponents) Let uj{r,s,t) be the infimum of all the ex- 
ponents oj' for which T {n''' , , n*) = 0{n^ ). We let uj = a;(l, 1, 1) he the exponent of square matrix 
multiplication. 

Theorem 2.2 (Coppersmith and Winograd [7J) uj < 2.376. 

In fact, Coppersmith [6j proved that if a is the supremum over all constants r for which uj{1, r, 1) = 2 
then a > 0.294. The following lemmas are obtained by decomposing a given matrix product into 
smaller products. (See, e.g., Huang and Pan [9j.) 

r 2 ifO<r<a, 

Lemma 2.3 a;(l,r, 1) < < o i ^-2/ \ t ^ ^ a 
^ ' ' ^—12 + — a) ij a < r < 1. 

Lemma 2.4 a;(l, r, r) = u;{r, r,l) < 1 + (u — l)r, for < r < 1. 

Definition 2.5 (Distance products) Let A be an i x m matrix, and let B he a m x n matrix. 
Their £ xn distance product C = A-k B is defined as follows: C[i, j] = mm'^^^{ A[i, k] + B[k,j]}, for 
1 < i < i and 1 < j < n. 

It is easy to see that if is an n x n matrix containing the edge weights of an n- vertex graph, 
then W^, the n-th power of W with respect to distance products, is the distance matrix of the graph 
(recall that we assume no negative weight cycles). Namely, W"'[i,j] = S{i,j) where S(i,j) denotes 
the distance from i to j. 

As the fast algebraic matrix multiplication algorithms rely heavily on the ability to perform 
subtractions, they cannot be used directly for the computation of distance products. Nevertheless, 
we can get the following result, first stated by Alon et al. [2], following a related idea of Yuval [T5] . 

Lemma 2.6 Let A be an x matrix and let B he an x n* matrix, both with elements taken from 
{— M, . . . , 0, . . . , M} U {+oo}. Then, the distance product Ai^ B can be computed in 0(Mn'^('''*'*)) 
time. 

Definition 2.7 (Truncation) If D is a matrix and t is a positive integer, let {D)t he the matrix 
obtained from D by replacing all the entries that are larger than t or smaller than —t by +oo. In 
other words, {D)t[i,j] = D[i,j], if\D[i,j]\ < t, and {D)t[i,j] = +oo, otherwise. 

Zwick [17] used several novel ideas combining truncated distance products, the notion of bridging 
sets, and fast matrix multiplication, to obtain the presently fastest APSP algorithm in dense directed 
graphs with integer edge weights. 

Theorem 2.8 (Zwick |17] ) Let G he a directed graph with n vertices with integer edge weights in 
{— M, . . . , 0, . . . , M}. Let M = n*. There is an algorithm that computes the distance matrix of G 
in ©(n^"*"^^*^) time, where fi = //(t) satisfies the equation 1) = 1 + 2^ — t. In particular, the 

algorithm runs in 0(n^'^'^^) time if M is bounded. 
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Definition 2.9 (Partial distance matrix) A partial distance matrix of a graph G is a matrix P 
such that P -k P is the distance matrix of G. 

Generalizing the above result of Zwick, it was shown by the author and Zwick how a partial distance 
matrix can be computed much faster. Observe that once a partial distance matrix is computed, the 
distance between any given pair of vertices can be computed in 0{n) time. 

Theorem 2.10 (Yuster and Zwick |16j ) Let G be a directed graph having n vertices and integer 
edge weights in {— M, . . . , 0, . . . , M}. There exists an algorithm that computes a partial distance 
matrix of G in 0{Mn^) time. 

3 Redundant partial distance matrices 

For two vertices u,v oi a graph, let c(n, v) denote the smallest number of edges in a path that realizes 
5{u, v). 

Definition 3.1 ((/?, 7)-redundant partial distance matrix) Let G = {V,E) be a directed graph. 
For nonnegative parameters /3,7 with 0</3 + 7<l we say that a matrix P whose rows and columns 
are indexed by V is a 7) -redundant partial distance matrix ({/3,'y)-RPDM) if the following holds: 

1. For each pair u,v with c{u, v) < n^~^ there exists some x £ V such that P[u, x] + P[x, v] = 
5{u,v). Furthermore: 

2. There exists a path p^^v with c{u,v) edges realizing 5{u,v) such that each segment of p^^v 
consisting of \n^~^~'^^^ edges contains a vertex x such that P[u^x\ + P[x,v] = 5{u,v). 

The algorithm given in Figure [J computes a (/3, 7)-RPDM of a directed graph G = {V,E) with 
integer edge weights in {— M, . . . , 0, . . . , M} whose adjacency matrix is given as the input parameter 
W (non-edges represented by +00 in W). For subsets of vertices X and Y, the notation y] 
appearing in the algorithm denotes the sub-matrix of P consisting of the rows X and columns Y. 
For matrices R and S with the same dimensions, the notation R < — S denotes the the assignment 
to R of the matrix whose entry [u,v] is the minimum of R[u,v] and S'[n, u]. The algorithm in 
Figure [U is a modification of the algorithm from jl6j for computing partial distance matrices. In 
fact, the first for loop (consisting of \log^/2{''^^~^~'^)'\ iterations) is identical to the algorithm in 
|16j . The second for loop differs from the first one in that B remains constant and is no longer 
decreased by sampling. We next prove the correctness and compute the running time of the algorithm 
redundant-partial-distance- matrix. 

Lemma 3.2 With high probability (at least 1 — 0{logn/n)), redundant-partial-distance-matrix 

correctly computes a {f3,'y)-RPDM. 

Proof. The first part of the proof is identical to the proof in [16] . Let B£ denote the random subset 
B of the £-th iteration (observe that when £ < [log3/2('^^~'^~''^)l '^^ are in the first for loop and 
otherwise we are in the second for loop). We first note that Lemma 4.1 of |16j remains intact: If 
i £ Bi or j G Bi, and there is a shortest path from i to j in G that uses at most (3/2)^ edges, then 
after the ^-th iteration, with high probability (at least l — 0(log n/n)) we have P[i,j] = (recall 
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algorithm redundant-partial-distance-matrix(W„xn! l) 

y ^{l,2,...,n} 
B ] P 

for £ ^ 1 to riog3/2(ni-/5-T)l 
. ^ (3/2)^ 

B ^ sample(i?, (9n In n) / s) 

P[V^B] ^ {P[V,B])sM * {P[B,B]),M 
P[B,V] {P[B,B])sM * {P[B,V])sM 
endf or 

for i ^ riog3/2(ni-^-^)l + 1 to riog3/2 (27^1-/5)1 
. ^ (3/2)^ 

P[V,B] {P[V,B])sM * {P[B,B])sM 

P[B,V] {P[B,B])sM * {P[B,V])sM 
endf or 
return P 



Figure 1: Computing a (/3, 7)-redundant partial distance matrix. 



that the only difference between our algorithm and the algorithm from [16] is that we stop sampling 
from B after the end of the first for loop; this is of course to our advantage since the probability of 
hitting a path with a larger sample is larger). 

Let u and v be two vertices with c{u, v) < , and let pu,v be a path with c{u, v) edges from u 
to V realizing 5{u,v). To establish the first part in the definition of a (/3,7)-RPDM, we only need to 
show that at the end of the algorithm there is, with high probability, some x € i?^ on Pu.v This is 
identical to the proof of Lemma 4.2 in [T6] (as shown there, such an x already appears in B^ where 
(3/2)^ > c{u,v)). But in order to satisfy the second part in the definition of a (/3,7)-RPDM we need 
to prove something stronger: we must show that, with high probability, any segment of v}~l^~'^ edges 
of pu,v contains some x & Bi at the end of the algorithm. As Bi remains the same after the last 
iteration of the first for loop, we must prove that a random subset of size 9nlnn/n^~^~'^ vertices 
hits every such segment of pu^v Indeed, the probability that no vertex of Bf hits a specific segment 
is at most 

(1 - ginn/n^-'^-^)"'"''"^ < n'^ . 

As there are less than n segments to consider in Pu,v, and as there are less than pairs of vertices 
u,v to consider, we have that with high probability (larger than 1 — 0(logn/n)), the set Bg at the 
end of the first for loop (and hence also at the end of the algorithm; it is the same set) hits every 
segment of n^~^~'^ edges of each pu^^ with c{u,v) < n^~^. ■ 

Lemma 3.3 redundant-partial-distance-matrix runs in time. 



6 



Proof. There are a logarithmic number of iterations, and the most time consuming operation in each 
iteration is the computation of a distance product. By Lemma [2T6l a distance product in the first 
for loop can be computed in 0{sM x T{n,n/s,n/s)) time. By Lemma 12.41 multiplying an n x n/s 
matrix with an n/s x n/s matrix requires 0{s{n/s)'^) operations. Hence, the time to perform a single 
distance product is 0{Ms'^{n/s)^). Since u) > 2, this is never larger than 0{Mn^) (this is the same 
argument that is used in [16j to show that their algorithm runs in 0{Mn^) time). By Lemma 12.61 a 
distance product in the second for loop can be computed in 0{sMn'^^^'^~^'^'^~^'^^). This is maximized 
in the last iteration where s = Q{n^~^). By Lemma 12.41 this amounts to 0{Mn^~^^'^~'^^^^~^"'^~^"'). ■ 



4 Proof of Theorem 11.11 

Let G = {V, E) be a directed graph with n vertices and with integer edge weights taken from 
{— M, . . . , 0, . . . , Af}. Recall that G is assumed to contain no negative weight cycles. Let d be any 
integer and let D = {{u,v) : S{u,v) < d}. Our goal is to construct the set D. Observe that if 
d < —nM then D = (otherwise there are negative cycles). Also, if d > nM then D is precisely the 
set of pairs {u, v) with 6{u, v) < oo and in this case D can trivially be obtained from the transitive 
closure of the unweighted version of G. Hence we will assume that \d\ < nM. 



4.1 Retrieving distances that are realized by long paths 

The first part of our algorithm computes the distances between all pairs of vertices for which c{u, v) 
is large. More precisely, let /3 be a chosen such that 



n 



P = M~n^^^ (1) 



and let t = n^ ^. We compute, for each ordered pair of vertices {u, v), and with very high probability, 
a value St{u,v) which satisfies St{u,v) > 6{u,v) and if c{u,v) > t then 5t{u,v) = 5{u,v). 

Lemma 4.1 There is an algorithm that computes, with probability at least 1 — 0(l/n), values 5t{u, v) 
for each ordered pair of vertices {u,v) such that 6t{u,v) > 5{u,v) and ifc{u,v) > t then 5t{u,v) = 
6{u,v). The algorithm runs in 0(n^+^ + Mn'^) time. 

Proof. Our algorithm uses the well-known "long shortest path" method to compute the distances 
between pairs whose distance is realized only by paths with at least t edges. The idea behind this 
simple method is that a random large subset X CV hits all of these "long" shortest paths. 
More formally, let 

C = {{u,v) : c{u,v) > t}. 

For each pair in C, fix a shortest path p^^^ from u to v. Each such path contains at least t + 1 vertices 
(including endpoints). Let X be a random subset of 8nlnn/t vertices. For a pair {u,v) G C, what 
is the probability that no element of X lies on Pu,v'^ Clearly, this probability is at most 



n J \ 1^ J \ n- J 



As \C\ < r? is follows by the union bound that with high probability (at least 1 — 0(l/n)), X 
intersects each p^^^ for (n, v) G C . 
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So, assume that X is a set of 8nlnn/t vertices intersecting all Pu^v for (u, v) € C. For each x € X 
our next goal is to compute, using a single-source (SSSP) algorithm, all the distances 6{x, v) and all 
the distances 5{v, x) for all v G V. 

Unfortunately, G has negative edge weights so performing \X\ applications of SSSP is too costly. 
As observed by Johnson [10], by an appropriate reweighing, we can settle for just one application of 
SSSP and then reduce the problem to SSSP in a graph with non-negative edge weights. Johnson's 
reweighing consists of running a single application of SSSP from a new vertex, denoted by r, connected 
with directed edges of weight from r to each vertex of V. An 0{Mn^) time SSSP algorithm for 
directed graphs with integer weights in {— M, . . . , 0, . . . , M} was obtained in |1H I16j. It follows that 
the required reweighing of G can be obtained in 0[Mn^) time. The reweighing consists of assigning 
vertex weights h{v) for each v &V (these are the distances from r to v after applying SSSP from r). 
The new weight, denoted by 'w+{u,v) is just w{u,v) + h{u) — h{v) > 0. It now suffices to compute 
SSSP from each vertex of X in G (and similarly in its edge-reversed version). This, in turn, can be 
performed in 0{n'^) time for each vertex of X, using Dijkstra's algorithm. 

Altogether, the running time required to obtain all of the distances 5(x, v) and 5{v, x) for all 
X G X and v is 

6{n^\X\ + Mn'^) = d{n^/t + Mn'^) = d{n'^+'^ + Mn'^) . 
Next, for each ordered pair of vertices {u,v), we compute 

5t(u, v) = min5(n, x) -\- 6(x, v) . 

Observe that the right hand side of the last equation is either infinity or a weight of some walk from 
u to V and thereby 5t{u,v) > S{u,v). But for pairs {u,v) € G the property of X guarantees that, in 
fact, 5t{u,v) = 5{u,v), as required. 

The running time to obtain all of the values 6t(u, v) is 0(n^|X|) < 0{n'^~^^). The overall running 
time of the algorithm is therefore 0{n^^^ + Mn^), as claimed. ■ 



4.2 Obtaining a good additive approximation 

The second part of our algorithm computes approximate distances between all pairs of vertices for 
which c{u, v) is relatively small. This process consists of a logarithmic number of steps, where each 
step guarantees to approximate distances 5{u,v) for a specific range of c{u,v). More precisely, for 
z = 0, . . . , [(1 — /3) log nj , let tj and /3i be defined by 

21 

and observe that /3o = (3 and to = t. Step i computes, for each ordered pair of vertices, and with 
very high probability, a value 5*{u,v) satisfying 5*{u,v) > 6{u,v) and if tj/2 < c{u,v) < U then 
5*{u,v) < 5{u,v) +2ki for a suitably chosen approximation parameter ki. Observe that for any pair 
{u,v) with c{u,v) < t, there exists some i for which tj/2 < c{u,v) < ti. 
Let 7i be defined by 

' ,1- 



n 



and observe that n^'"'"'''' < n so /3i -|- 7^ < 1. Since f3o = (3 we also define 7 



70- 
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algorithm additive-approximate(Pj, 7^, k^) 

X ^ sample(y, I2n^~'^' logn) 
Qi^Ri[V,X] * R^[X,V] 
return kiQi 



Figure 2: Obtaining an additive approximation. 

Let Pi be a (/3j, 7i)-RPDM computed by the algorithm in Section 3. We use Pi to obtain the 
claimed additive approximation. 

Algorithm additive-approximate in Figure [2] accepts Pi and 7^ as input, as well as an approx- 
imation parameter ki. It returns a matrix such that with high probability, its entry [u,v] is close to 
5{u,v) whenever ti/2 < c{u,v) < U. 

Lemma 4.2 With probability 1 — 0{l/n), the matrix kiQi returned by additive-approximate 

has the property that for each ordered pair of vertices {u,v) we have kiQi[u,v] > 5{u,v) and if 
< c{u,v) < ti then kiQi[u,v] < 5{u,v) -\- 2ki. Using ki = Mn}~^^~'^'- the running time of 
additive-approximate is 0(n'>^«+'^(i'i~T^»'i)). 

Proof. Consider an ordered pair of vertices {u,v) for which ti/2 < c{u,v) < ti = By the 

definition of Pi, we have that there exists a path pu,v with c{u,v) edges realizing 5{u,v) where each 
segment of edges of pu^v contains a vertex x such that P[u,x\ + P[x,v] = 6{u,v). In 

particular, there are at least 

c{u,v) ^ tj _ 

such vertices x. This, in turn, implies that a random subset X of 12n^~'''' log n vertices contains, with 
high probability (at least 1 — 0(l/n)) at least one such x for each pair {u, v) with ti/2 < c{u, v) < ti. 
This means that, w.h.p., the entry [u, v] of the product Pi[V, X] V] contains the precise value 

of S{u,v) for such a pair {u,v). Performing this distance product is, however, costly. Instead, we 
divide each finite element in Pi by the number ki. More precisely, the line Ri ^ Pi/ki denotes that 
Ri[u,v] = [Pi[n, uJ/Zcj] for all elements of P. We have 

Pi[u,x] + Pi[x,v] < ki{Ri[u,x] + Ri[x,v]) < Pi[u,x] + Pi[x,v] + 2ki 

and in particular, for all {u,v) with ti/2 < c{u,v) < ti, 

6{u,v) < kiQi[u,v] < 5{u,v) + 2ki . 

Notice that for other pairs, the value kiQi\u,v\ is at least 6{u,v) since the entries in Ri are rounded 
up. 

When applying additive-approximate we will use ki = Mn^^^'-^''\ Observe that the finite 
entries in Pj have value 0{Mri}~l^^) and hence the finite entries in Ri have value 0{ri'^). The 
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running time of additive-approximate is dominated by the distance product of an n x |X| sub- 
matrix of R with an \X\ x n sub-matrix of R. Hence, by Lemma 12.61 and since \X\ = 0{n^~^^), we 
have that additive-approximate runs in 0{n'^^~^'^^^'^~'^^'^^) time. ■ 

4.3 Targeting pairs having distance at most d 

The final part of our algorithm correctly reports all pairs {u,v) with 6{u,v) < d. In the previous 
parts of our algorithm we have computed, for each pair {u,v), values 6t{u,v) and 6*{u,v) for i = 
0,..., [(1 -/3)lognJ. Let 

[(l-r) lognj 

6*{u,v) = mm{5t{u,v) , min 5*{u,v)} . 

1=0 

The following lemma is a consequence of these computed values. 

Lemma 4.3 For any pair {u,v) we have 5{u,v) < d*{u,v) < 6{u,v) + 2Mv}~^~"' . 

Proof. Recall that all computed values are either infinite or upper bounds of weights of some 
walks from u to u and therefore 6{u,v) < 5*{u,v). Now, if c{u,v) > t then 6t{u,v) = 6{u,v). 
Otherwise, there is some i such that ti/2 < c{u,v) < ti in which case 6*{u,v) < 6{u,v) + 2ki. Since 
ki = Mn^~^'-~'^' the claim follows once we observe that /? + 7 < /3j + 7i. Indeed this holds since by 

m 

■ 

By Lemma 14.31 any pair {u,v) with 5*{u,v) < d is reported as having 6{u,v) < d, as required. 
Similarly, any pair with 6*{u,v) > d + 2Mn^~^~'^ is reported as having 6{u,v) > d, as required. So 
we remain with the following set of pairs 

C* = {{u, v) ■.d<6*{u,v) <d + 2Mn^-^-^} 

where we must determine, for each (u, v) G C*, whether or not 5{u, v) < d. In fact, we will determine 
6{u,v) precisely for all {u,v) G C*. 

Let Ci denote the set of pairs in C* for which ti/2 < c{u,v) < ti. Observe that we do not 
know Ci and, furthermore, there may be pairs {u, v) G C* that are in no Ci as it may be that 
6*{u,v) = 5t{u,v). Consider again the 7i)-RPDM matrix Pi. By the definition of Pi, for each 
{u,v) G Ci, there is a shortest path pu^v from u to v with c{u,v) edges such that every segment of 
|"j^i-ft-7i"| edges of pu,v contains a vertex x such that Pi[u, x] + Pi[x, v] = 6{u, v). In particular, there 
is such an x where 

P^[u,x],P,[x,v] G [^^^ - Mn^-ft-^' , + Mn'-^^-^^] . 

But for pairs {u, v) G C* we have, in particular, that 

Pi[u,x],Pi[x,v] G [^-2Mni-^-^, ^ + 2Mn^-^-^] . 
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algorithm target-distances(Pj, /3, 7, d) 

for {u,v) E V X V 

if Pi[u,v] G - 2Mn^-^-^ , | + 2Afni-^-T] 

Si[u,v] ^ Pi[u,v] - Li - 2Mni-^-TJ 
else 

^^[li, w] ^ — hoo 

return Ri + 2l^- 2Mn^-^~^\ 



Figure 3: Computing distances for pairs in Ci. 

Let Si be the matrix obtained from Pi by replacing each entry not in — 2Mn^~^~^ , ^ + 
2Mn^~^~'^'] with +00, and by decreasing each remaining entry by [f - 2Mn^-^-^\ . The distance 
product Ri = Si* Si has, therefore, the property that for each {u, v) G Ci, 

Ri[u,v] + 21'^- 2Mn^-^~^\ = 6{u, v) . 

This procedure is summarized in Algorithm target-distances given in Figure El As each entry of 
Si has value 0{Mn^~^~'^) the distance product Si * Si takes 0{Mn^^^~'^~^'^) time. The following 
lemma proves the correctness of our algorithm. 

Lemma 4.4 For each {u, v) € C* we have 

[(l-r)lognj ^ 

5{u,v)= m.m{5t {u,v) , min Ri[u,v]+2y-- 2Mn^-'^-^\ } . 

i=0 2 

Furthermore, all the values 5{u,v) for {u,v) G C* are computed in 0{Mn^~^~'^~^'^) time. 

Proof. Let {u,v) G C*. If c(u,u) > t then we already have 5t{u,v) = 5{u,v) as required. Otherwise, 
for some i, ti/2 < c{u, v) < ti in which case we have shown that algorithm target-distances applied 
to Pi gives 

Ri [u, v]+2[^- 2Mn^-^-^\ = 5{u, v) . 

Algorithm target-distances is applied O(logn) times, once for each Pj. As a single application of 
target-distances runs in 0{Mn^''l^~"'~^^) time, the claim regarding the running time follows. ■ 

4.4 Running time 

It remains to show that all ingredients of our algorithm run in 0{M'^/^'^^^^rS^'^^^^^^^^^'>) time. 

The first part of our algorithm, given as Lemma |4.H runs in 0{n^+P + Mv}^) time. Recall that 
we can assume that M < n^~'^ (as otherwise the usual cubic algorithms are faster, as mentioned in 
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the introduction) so by ([T]), Mn^ = 0{'n?^^) and also 

Hence this part of the algorithm satisfies the claimed running time assertion. 

Computing a 7j)-RPDM (and recall that we perform this O(logn) times) takes 0{Mn^ + 
jy^^2+(a;-2)(/3i+7i)+7i^ time, as shown by Lemma 13.31 Using ([2]) it is straightforward to verify that 

7Vfn2+("-2)(ft+7>)+7« = Mn'^n^^"'^')/" . 

As /3i > /3o = /3 the worst case occurs when i = 0. Plugging in the value of from ([1]) we obtain 
that 

Hence this does not exceed the claimed running time assertion. 

The next part is algorithm additive-approximate whose running time is 0{n'^^~^'^^^'^~"'^'^^) as 
stated in Lemma |4.2[ We can use Lemma |2.3| but even if we use the naive bound a;(l, 1 — 7^, 1) < 
2 + (w — 2)(1 — 7j) we can see that this part is not a bottleneck of the algorithm. Indeed, since 
7i < 7o = 7 we have 

^7>+'^(l,l-7i,l) < ^7»+2+(a;-2)(l-7,) ^ ^{3~uJ)'r^+uJ ^ ^{3~iu)'y+uj ^ ^(3-<^)ii^(l-/3)+a; ^ 



uj^-6lj^+16lj-9 (3-uj){lj-1) 

n M "+1 < n ^^+1 



where the last inequality is valid for all a; < 3 (and is a strict inequality for 2 < a; < 3). 

The final part is algorithm target-distances given as Lemma |4.4[ It runs in 0{Mn^~^~'^'^'^) 
time. Plugging in the values from ([T|) and ([2]) we obtain 

We have established the claimed running time of our algorithm, thereby concluding the proof of 
Theorem II. 1[ ■ 



5 Proof of Theorem 11.21 

For a nonnegative integer k and for a positive integer M we define a set of nonnegative integers 
F(k, M) recursively as follows. 



F{k,M) 



{0,...,k} iik<M + l, 



For example, F(100, 4) = {100, 52, ... , 48, 28, ... , 22, 16, ... , 0}. Observe that F{k, M) consists of 
0(log(A; + M)) intervals of consecutive integers. 
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Lemma 5.1 \F{k,M)\ = 0{M log k). 

Proof. Assume that k > 3M otherwise the claim clearly holds. The j'th level of the recursion 
defining F{k, M) consists of unions of sets F{i, M) where i is contained in the interval [k/2^ — M — 
1, k/2^ + M + 1]. As there are 0{log{M + k)) levels of recursion, the claim follows. ■ 

For a directed graph G = {V,E) with edge weights in {!,..., M} let Ak denote the Boolean 
matrix whose rows and columns are indexed by V, and ^^[u, = 1 if and only if S{u,v) < k. Also 
define to be the identity matrix. For a given threshold value d our goal is to compute A^- 

Our algorithm will, in fact, compute all the matrices Ai for i £ F{d,M). In particular, as 
d € F{d,M), we will eventually obtain the required A^. 

For i G F{d, M) we say that i is primal \ii < M + \. We say that i belongs to level j if F{i, M) 
appears in the j'th level of the recursion (observe that i may belong to more than one level). We 
denote by L{d,M,j) the interval of consecutive integers forming the j'th level. As observed in the 
proof of Lemma EH L{d,M,j) C [d/2^ - M -l,d/2^ + M + 1] so in particular \L{d,M,j)\ < 2M + 3. 
For example, consider again F(100, 4). Then 100 is at level 0. Level 1 consists of {48, . . . , 52}. Level 
2 consists of {22, . . . , 28}. Level 3 consists of {9, . . . , 16}. Level 4 consists of {2, . . . , 10}. Level 5 
consists of {1, . . . , 7} (recursion continues only for i > M + 1 so for i > 6 in this example). Level 6 
consists of {1, . . . , 6}. Finally, level 7 consists of {1, . . . , 5} and is the last level as it consists only of 
primal values. 

The following lemma essentially proves Theorem 11.21 

Lemma 5.2 Given all the matrices Ai for i € L{d, M, j + 1)U {0, . . . , M + 1} the set of all matrices 
Ak for k G L{d,M,j) can be computed in 0{Mn^) time. 

Proof. Consider some matrix A^ for k G L(d, M, j). If k < M + 1 there is nothing to prove so 
assume that k > M + 1. Suppose that u and v are vertices such that 5{u,v) < k, and consider a 
shortest path from u to v. Let w be the first vertex on this path for which 6{u,w) > \_{k — M)/2j. 
If there is no such vertex, then already 6{u,v) < [{k — M)/2\. Otherwise, since each weight is in 
{1, . . . , M}, then 6{u, w) < [{k- M)/2\ + M - 1. Furthermore, 5{w, v) < k - [{k - M) /2j . In any 
case Ak is just the Boolean or of the following Boolean Matrix products: 

By the definition of F{d,M), if k £ L{d,M,j) then all the indices i and A; — i in ([3]) belong to 
L{d, M,j + 1). Since Boolean matrix multiplication can be performed in 0{n'^) time, ^ shows that 
we can compute A^ in 0{Mn'^) time. 

The only problem that remains is that we do not want to compute a single A^. We want all 
Ak for aU k G L{d,M,j) and as \L{d,M,j)\ = 0{M) this takes 0{M'^n'^) if we compute each Aj. 
separately. 

To overcome this problem we use the following matrix convolution idea. We construct a single 
matrix B that encodes all the matrices Ai for i G L(d,M,j + 1) at once. Set s = \L{d,M,j + 1)| 
and let t be the smallest index in \L{d,M,j + 1)|. Recalling that L(d,M,j + 1) is an interval of 
consecutive integers we have L{d, M,j + 1) = {t,t + 1, . . . ,t + s — 1}. Let B[u, v] be the following 
polynomial of degree at most s — 1. 

s-l 

B[u,v] =^Aq+t[u,v]x'i . 

9=0 
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Thus, the coefficient of x'^ encodes the matrix ^g+j. 

Now consider C = (product performed over the ring of polynomials in a single variable). Each 
element of C is therefore a polynomial of degree at most 2s — 2. Consider the coefficient of a;^"^* in 
C[ii, u]. The only way it can be non-zero is if for some i, AiAk_i is nonzero in entry [u,v\. Hence, 
we obtain A^^ just by examining the coefficients of x^~'^^ in the entries of C . 

The only thing that remains is to consider the complexity of computing C. Two matrices whose 
entries are polynomials of degree at most s — 1 and whose coefficients are bounded integers (in our 
case the coefficients are either and 1) can be multiplied in 0{sn^) time. The standard trick is to 
replace the variable x with a large number (say n + 1 if the dimension of the matrices is n) so that no 
carry is introduced when reading the product entries as digits in base n + 1, and thereby constructing 
the polynomials in the entries of the product. However, observe that replacing the variable x with 
n + 1 causes the entries to become as large as 0{n'') and hence consist of O(slogn) bits. Thus, each 
matrix operation incurs an 0(s)-factor that cannot be ignored. 

In our case we have s = 0{M) so we conclude that all the matrices A^ for k E L(d,M,j) can be 
computed in 0{Mn^) time. ■ 

The proof of Theorem 11.21 now follows from Lemma 15.21 bv recalling that the number of levels is 
0{log{d-\-M)). The only thing that remains is to compute the matrices Ai, ... Am+i that correspond 
to the primal indices. This, however, is relatively easy to do using distance products. In fact, the 
following lemma is a consequence of the result of Zwick from [17J. 

Lemma 5.3 Let G be a graph with n vertices and integer edge weights in {1, . . . , M}. There is an 
0{Mn'^) time algorithm that computes 6{u,v) for all pairs {u,v) which satisfy 6{u,v) < M + 1. 

Proof. Any pair that has 5{u,v) < M + 1 has at most M + 1 edges on any shortest path from u to 
V. The result of [17], specifically, algorithm rand-short-path, its complexity analysis, and Lemma 
4.2 therein, show that the exact distances for such pairs is computed in 0{Mn^) time. We note that 
during the computations of the distance products in each iteration of rand- short- path we never 
need to consider matrix entries with values exceeding M+1. As the bridging sets of each iteration are 
of size 0{n/ s) (see Lemma 4.2 in [17J), the time to compute the distance product of the rectangular 
matrices, even without resorting to fast rectangular matrix multiplications, is 0{M{n/ s)'^ s'^). As in 
our case we always have s = 0{M), the result follows. ■ 
Observe that once we have 5{u, v) for all pairs {u, v) with 6{u, v) < M + 1 then we immediately 
have the matrices Ai, . . . ,Am+i, as required. This concludes the proof that all A^ for k € F{d,M) 
are computed in 0{Mn^) time, and hence the proof of Theorem 11.21 

6 Concluding remarks 

We presented an algorithm that computes the diameter (and Threshold APSP) of an integer weighted 
directed graph polynomially faster than any presently known APSP algorithm. The algorithm is 
randomized and returns, with high probability, the precise diameter, as well as all pairs of vertices 
realizing it. 

Obtaining a truly subcubic algorithm for computing the diameter (moreover Threshold APSP) of 
real-weighted graphs remains an open problem. The prospects in this case, however, seem gloomier. 
It is likely that Diameter and APSP are equivalent under sub-cubic reductions. A recent result 
of Vassilevska Williams and Williams [H] asserts that the existence of a truly subcubic algorithm 
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for real-weighted APSP is equivalent under subcubic reductions to the existence of truly subcubic 
algorithms for a list of problems that seem "lighter" than APSP. 
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